Preprocessors for Noisy Speech
نویسنده
چکیده
Objectives: The objective of this project is to develop a preprocessor for speech recognition systems operating in noisy environments. The preprocessor, consisting of a nonlinear inhomogeneous transmission line, will be realized in software, although realization in hardware in FYgl should be possible. More specifically we will: 1) Develop a nonlinear transmission line preprocessor that accurately simulates the mechanics of the mammalian inner ear at all sound pressure levels. 2) Preprocess speech with the nonlinear transmission line and show that there is a substantial improvement in the signal to noise ratio. 3) Assess the desirability and feasibility of implimenting either a digital or analog transmission line on a chip and using it as a preprocessor in the CMU, BBN, or MIT DARPA funded speech recognition systems.
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملA Preprocessor for Speech Recognition Systems Operating in Noisy Environments
Objectives: The recognition of speech in noisy environments is critical to certain DoD systems now under development. Current preprocessors for speech recognition systems, such as those based on "linear predictive coding," are linear and therefore not effective in noisy environments. The objective of this project is to develop a nonlinear preprocessor for speech recognition systems that signifi...
متن کاملAuditory Speech Preprocessors
• Speech preprocessors are important. Small improvements at the beginning of the recognition process can lead to substantial improvements by the end. Resolving acoustic ambiguities decreases the number of possibilities that must resolved by higher level linguistic processing. • The past: Much has been learned about speech preproeessing from the inner ear of vertebrates. Historically, this appro...
متن کاملOptimized estimation of spectral parameters for the coding of noisy speech
In this contribution we optimize a speech enhancement preprocessor such that a distortion measure in the Line Spectral Frequency (LSF) domain is minimized. We can thus improve the estimation of spectral parameters of a speech coder when the input signal to the coder is a noisy speech signal. The optimization aims at the maximum noise reduction of the enhancement preprocessor. The average maximu...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1989